Computing the Quartet Distance Between Trees of Arbitrary Degrees
نویسندگان
چکیده
Comparing trees with regard to their topology is in itself an interesting theoretical problem in computer science, and furthermore researchers working in the interdisciplinary field of computational biology need tools to compare phylogenetic trees, i.e. trees that describe the relation of species according to evolutionary history. Different methods and different information can result in different phylogenetic trees, and consequently there is a need to be able to compare such trees. Comparison of trees can be done by calculating the distance between them, and among the distance measures usable on trees are the quartet distance. A quartet is a set of four leaves in a tree, and the edges in the tree connecting the leaves imply the topology of the quartet. The quartet distance between two trees containing the same leaves is the number of quartets containing the same four leaves that have different topology in the two trees. Previous algorithms focus on calculating the quartet distance between binary trees. We explore different approaches for calculating the quartet distance between trees of arbitrary degrees. Each approach gives rise to one or two algorithms with varying running times and space consumptions. The running times are verified experimentally and a possibility for reducing the space consumption of the fastest algorithm is discussed. We have implemented the fastest algorithm in a tool, which is also presented, along with the feature of visualizing the similarity of trees using the quartet distance.
منابع مشابه
Computing the Quartet Distance Between Trees of Arbitrary Degree
We present two algorithms for computing the quartet distance between trees of arbitrary degree. The quartet distance between two unrooted evolutionary trees is the number of quartets—sub-trees induced by four leaves—that differs between the trees. Previous algorithms focus on computing the quartet distance between binary trees. In this paper, we present two algorithms for computing the quartet ...
متن کاملAlgorithms for Computing the Quartet Distance between Trees of Arbitrary Degree
We present two algorithms for computing the quartet distance between trees of arbitrary degree. The quartet distance between two unrooted evolutionary trees is the number of quartets—sub-trees induced by four leaves—that differs between the trees. Previous algorithms focus on computing the quartet distance between binary trees. In this paper, we present two algorithms for computing the quartet ...
متن کاملEfficient algorithms for computing the triplet and quartet distance between trees of arbitrary degree
The triplet and quartet distances are distance measures to compare two rooted and two unrooted trees, respectively. The leaves of the two trees should have the same set of n labels. The distances are defined by enumerating all subsets of three labels (triplets) and four labels (quartets), respectively, and counting how often the induced topologies in the two input trees are different. In this p...
متن کاملAlgorithms for Computing the Quartet Distance
Evolutionary (Phylogenetic) trees are constructs of the biological and medical sciences, their purpose is to establish the relationship between a set of species (phyla). Often it is the case that the true evolutionary tree is unknown and one can only try to estimate it. Reconstruction methods are manifold and the resulting evolutionary trees are not guaranteed to be correct. In order to establi...
متن کاملOn the Scalability of Computing Triplet and Quartet Distances
In this paper we present an experimental evaluation of the algorithms by Brodal et al. [SODA 2013] for computing the triplet and quartet distance measures between two leaf labelled rooted and unrooted trees of arbitrary degree, respectively. The algorithms count the number of rooted tree topologies over sets of three leaves (triplets) and unrooted tree topologies over four leaves (quartets), re...
متن کامل